Holistic Approach for Urdu Character Recognition Using Modified Hmm
ثبت نشده
چکیده
Automatic recognition of cursive handwritten script remains a challenging problem even with the promising improvement in classifier and computational power. Segmentation based approach for recognition of handwritten Urdu script has considerable computational overhead and has lower accuracy as compared to Roman and Chinese script due to additional segmentation error. Presence of complimentary characters in Urdu language makes it complicated as they have to be segmented into secondary strokes which are associated with a primary stroke. This first phase introduces a ligature based approach using Hidden Markov Model that provides solution for recognition of Urdu script. HMM database is divided into 54 subclasses based on the starting and ending shapes of the ligature. Twenty six time variant features have been selected for the base strokes. The sub division in classes reduces the time complexity and increases the efficiency. The second part of this chapter presents a segmentation free approach for recognition of Online Urdu handwritten script using hybrid classifier, HMM and Fuzzy logic. Trained data set consisting of HMM's for each stroke is classified into 62 sub
منابع مشابه
The Optical Character Recognition for Cursive Script Using HMM: A Review
Automatic Character Recognition has wide variety of applications such as automatic postal mail sorting, number plate recognition and automatic form of reader and entering text from PDA's etc. Cursive script’s Automatic Character Recognition is a complex process facing unique issues unlike other scripts. Many solutions have been proposed in the literature to solve complexities of cursive scripts...
متن کاملRecognition of Urdu Character with Hmm Technique
This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable language in the world, especially in the subcontinent but fewer efforts are made to make it understandable to computers. Lot of work has been done in the field of literature and Islamic studies in Urdu, which has to be computerized. Research h...
متن کاملMulti-font Numerals Recognition for Urdu Script based Languages
Handwritten character recognition of Urdu script based languages is one of the most difficult task due to complexities of the script. Urdu script based languages has not received much attestation even this script is used more than 1/6th of the population. The complexities in the script makes more complicated the recognition process. The problem in handwritten numeral recognition is the shape si...
متن کاملLexicon Reduction for Urdu/Arabic Script Based Character Recognition: A Multilingual OCR
Arabic script character recognition is challenging task due to complexity of the script and huge number of ligatures. We present a method for the development of multilingual Arabic script OCR (Optical Character Recognition) and lexicon reduction for Arabic Script and its derivative languages. The objective of the proposed method is to overcome the large dataset Urdu and similar scripts by using...
متن کاملOptical Character Recognition System for Urdu Words in Nastaliq Font
Optical Character Recognition (OCR) has been an attractive research area for the last three decades and mature OCR systems reporting near to 100% recognition rates are available for many scripts/languages today. Despite these developments, research on recognition of text in many languages is still in its early days, Urdu being one of them. The limited existing literature on Urdu OCR is either l...
متن کامل